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Abstract 

In this paper we study noisy reversible circuits. Noisy computation and reversible computation have 
been studied separately, and it is known that they are equivalent in power to unrestricted computation. 
We study the case where both noise and reversibility are combined and show that the combined model 
• is weaker than unrestricted computation. 
y—{ \ We consider the model of reversible computation with noise, where the value of each wire in the 
circuit is flipped with some fixed probability 1/2 > p > each time step, and all the inputs to the 
q [ circuit are present in time 0. We prove that any noisy reversible circuit must have size exponential 
in its depth in order to compute a function with high probability. This is tight as we show that any 
(not necessarily reversible or noise-resistant) circuit can be converted into a reversible one that is 
noise-resistant with a blow up in size which is exponential in the depth. This establishes that noisy 
reversible computation has the power of the complexity class NC l . 

We extend the upper bound to quantum circuits, and prove that any noisy quantum circuit must 
\ have size exponential in its depth in order to compute a function with high probability. This high- 
OO . light the fact that current error- correction schemes for quantum computation require constant inputs 
throughout the computation (and not just at time 0), and shows that this is unavoidable. As for the 
lower bound, we show that quasi-polynomial noisy quantum circuits are at least powerful as quantum 
circuits with logarithmic depth (or QNC 1 ). Making these bounds tight is left open in the quantum 
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O-i, 1 Introduction 
-i— > 

In this paper we study noisy reversible circuits. Noisy computation and reversible computation have 
^ | been studied separately, and it is known that they arc equivalent in power to unrestricted computation. 
■ We study the case where both noise and reversibility are combined and show that the combined model 
| is weaker than unrestricted computation. 

The model of reversible noisy computation seems natural by itself, especially when viewed as a 
model of physical computation. It is also motivated by the current surge of interest in noise in 
quantum computation, which generalize reversible computation. Indeed, we extend our lower bounds 
to the quantum case. 

Reversible Circuits 

The subject of reversible computation, was first raised with connection to the question of how much 
energy is required to perform a computation. In this paper we do not wish to argue about when and 

* Department of Computer Science, Hebrew University 
' Department of Computer Science, Hebrew University 
iDept. of Computer Science, UCSD 
§ Department of Computer Science, Hebrew University 



1 



in what ways this reversibility condition is indeed a true requirement, but rather limit ourselves to 
study models in which this requirement holds. The reader can consult for example Landauer[[ll| and 
Bennett [| f§ § for more discussion. 

Our model of reversible computation will be boolean circuits which may use only reversible gates. 

Definition 1 A function g : {0,l} fc — > {0,l} fc is called reversible if it is 1-1 (thus, a permutation). 
A gate with k inputs and k outputs is called reversible if it computes a reversible function. 

Our circuits will be composed of reversible gates which belong to some fixed set of gates (our 
computation basis). As is usual in boolean circuits the exact choice of a base is not important as long 
as it is finite and "universal" . An example of a reversible gate which, by itself, is a universal family is 
the 3-input, 3-output Toffoli gate. In order to keep reversibility, we do not allow wires in the circuit 
to "split", i.e. the fanout of each output wire (or input bit) in the circuit is always 1. It follows that a 
reversible circuit has N inputs and N outputs, for some N, and that it computes a reversible function. 

In order to compute non-reversible functions by reversible circuits we apply the following convention. 

Definition 2 We say that a reversible function F : {0, 1} N — > {0, 1} N implements a boolean function 
f : {0,1}™ — * {0,1} if for every x e {0,1}™, the first output bit of F(x0) is f(x). Here means 
padding x by N ~ n 0-bits in order to get an N-bit input for F. 

The most basic simulation result of general circuits by reversible circuits states: 

Proposition 1 If a boolean function f : {0, 1}" — > {0, 1} can be computed by a circuit of size s and 
depth d, then it can be implemented by reversible circuits of size 0{s) and depth 0(d). 

More advanced simulation results are also known, e.g. that the output on input xO can be forced 
to be x o f(x) o 0, and not just an arbitrary string starting with f(x). 

Noisy Circuits 

Normal boolean circuits are very sensitive to "hardware failures" : if even a single gate or single 
wire malfunctions then the computation may be completely wrong. If one worries about the physical 
possibility of such failures, then it is desirable to design circuits that are more resilient. Much work 
has been done on this topic, starting from Von-Neumann p|. 

The usual models assume that each gate in the circuit can fail with some fixed probability p > 0, 
in which case its value is flipped or controlled by an adversary. Most upper bounds (as well as ours) 
work even for the case of an adversary, while most lower bounds (as well as ours) work even for the 
simpler case of random flips. The aim is to construct circuits that still compute, with high probability, 
the desired function, even when they are noisy. The probability of error achieved by the circuit (due 
to the noise) must be at most some fixed constant e < 1/2. The exact values of p and e turn out 
not to matter beyond constant factors as long as p is less than some threshold pq (which depends 
on the computational basis), and e is at least some threshold eo (which depends on p as well as 
on the computational basis). We say that such a circuit computes the function in a noise-resistant 
way. The basic simulation results regarding noisy circuits state that any circuit of size s and depth d 
can be converted into a noise-resistant one of size poly(s) and depth 0(d) which computes the same 
function. (For lower bounds on the blow-up in depth and size see [?, ?]). 

We will be considering a slightly different model in which the errors (noise) are not on the gates, 
but rather on the wires. We assume that each wire flips its value (or allows an adversary to control 
its value) with probability p, each "time unit" . This means that we view the depth of a gate in the 
circuit as corresponding to the "time" in which this gate has its latest input available (its output will 
be available one time unit later) . A wire that connects an output of gate at depth d to an input of a 
gate of depth d', will thus have d' — d time units in which its value may be flipped (or controlled by 
an adversary), with probability p each time unit. We call such circuits noisy circuits. 
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This model does seem reasonable as a model of many scenarios of noisy computations. An obvious 
example is a cellular automata whose state progresses in time, and it does seem that each cell can 
get corrupted each time unit. We invite the reader to consider her favorite physical computation 
device and see whether its model of errors agrees with ours. In any case, it is easy to see that this 
model is equivalent in power to the one with noise on the gates, and thus is as powerful as non-noisy 
circuits fll3|. 

Noisy Reversible Circuits 

The model under study in this paper is the model of reversible circuits, as defined above, when the 
wires are subject to noise, also as defined above. The combination of these two issues has not been, as 
far as we know, studied formally before, though it might seem natural that given that all operations 
are reversible, the effect of noise can not be corrected. Indeed, it turns out that this combination is 
more problematic, in terms of computation power, than each of the elements alone. 

We wish to emphasize that this "problematic" behavior appears only with the combination of the 
definitions we consider - which we feel are the interesting ones, in many cases. Specifically, it is not 
difficult to see that each of the following variant definitions of "noisy reversible" circuits turns out to 
be equivalent in power to normal circuits. 

• The noise is on the gates instead of on the wires. 

• The noise on each wire is constant instead of being constant per time unit. 

• The inputs to the circuit can be connected at an arbitrary "level" of the circuit (corresponding 
to at an arbitrary time) , as opposed to only at "time 0" . 

• Constant inputs can be connected at arbitrary levels. 

• The reversible circuits may contain 1-to-l functions with a different number of input and output 
bits. 

We first show that noisy reversible circuits can simulate general ones, although with a price which 
is exponential in the depth. 

Theorem 1 If a boolean function f can be computed by a circuit of size s and depth d, then f can 
be computed by a reversible noisy circuit of size 0{s ■ 2°W) and depth 0{d). 

Our main theorem shows that this exponential blowup in depth is un-avoidable. It turns out that 
noisy reversible circuits must have size which is exponential in the depth in order to do anything 
useful. 

Definition 3 We say that a noisy reversible circuit is worthless if on every fixed input, its first output 
bit takes both values (0 and 1) with probability of at least 49/100 each. 

This means that a worthless circuit simply outputs random noise on its first output bit, whatever 
the input is. 

Theorem 2 For any noisy reversible circuit of size s and depth d which is not worthless, s = . 

This give a full characterization of polynomial size noisy reversible circuits. 

Corollary 1 Polynomial size noisy reversible circuits have exactly the power of (non-uniform) NC 1 . 

Note that for the lower bounds we do not assume anything on the fan- in of the gates: In fact, the 
lower bound still hold even if the gates may operate on all the qubits together. 
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Quantum Circuits 

We extend the upper bound, and a weaker version of the lower bound, to quantum circuits, which 
are the quantum generalization of reversible circuits. We will be using the model of quantum circuits 
with mixed states, suggested in Q . 

The computation is performed by letting a system of n quantum bits, or "qubits" , develop in time. 
The state of these n qubits is a vector in the complex Hilbert Space TiV; , generated by the vectors 
|0 >, |1 >, ...\2 n ~ l >, where the numbers i are written in binary representation. This vector space is 
viewed as a tensor product of n copies of H2, each corresponding to one of the qubits. The initial state 
is one of the basic state which corresponds to the input string. This state develops in time according 
to the gates in the circuit. A quantum gate of order A: is a unitary matrix operating in the Hilbert 
space H2 of k qubits. Our circuits will be composed of quantum gates which belong to some fixed 
set of gates (our computation basis). As is usual in boolean circuits the exact choice of a base is not 
important as long as it is finite and "universal" [|| Q . Keeping the number of qubits constant in time, 
we do not allow wires in the circuit to "split", i.e. the fanout of each output wire (or input bit) in the 
circuit is always 1. It follows that a quantum circuit has N inputs and N outputs, for some N. The 
function that the quantum circuit computes is defined as the result of a "measurement" of the first 
qubit: i.e. some kind of projection of the final state on a subspace of ■ I n the model of quantum 
circuits with mixed states, we allow the n qubits to be in some probability distribution over vectors in 
the Hilbert space, and such a general (mixed) state is best described by the physical notion of density 
matrices. 

Noisy Quantum Circuits 

As in the classical case, we consider the model of noise on the wires. We assume that each wire 
(qubit) allows an adversary to control its "value" with probability p, each "time unit" . The definition 
of Quantum noise is more subtle than that of classical noise since the "value" of a qubit is not always 
defined. Instead, what we mean by "controlling the qubit" is the following operation: An arbitrary 
operation on the "controlled" qubit and the state of the environment, represented by m qubits in 
some arbitrary state, is applied, after which the state of the environment is averaged upon, to give 
the (reduced) density matrix to the n qubits of the circuit. This type of damage on a qubit occurs 
with probability p each time step for each qubit. The computation is composed alternately of noise 
steps and computation steps. We call such quantum circuits noisy quantum circuits. 

We first show the quantum analog of theorem [l], i.e. that noisy quantum circuits must have size 
which is exponential in the depth in order to do anything useful. 

Theorem 3 For any noisy quantum circuit of size s and depth d which is not worthless, s = . 

Where, as for reversible circuits, we say that a noisy quantum circuit is worthless if on every fixed 
input, its first output bit takes both values (0 and 1) with probability of at least 49/100 each. 

We next give a lower bound on the power of noisy quantum circuits: We show that noisy quantum 
circuits can simulate general quantum circuits, with an exponential cost. 

Theorem A If a boolean function f can be computed by a quantum circuit of size s and depth d, 
then f can be computed by a noisy quantum circuit of size 0{s ■ poly log (s)) ■ 2°( d P ^ s(<i)) ana > depth 
0(d ■ polylog{d)) . 

This gives a characterization of polynomial size noisy quantum circuits. 

Corollary 2 Polynomial size noisy quantum circuits are not stronger than quantum circuits with 
0(log(n)) depth (The class QNC 1 ). On the other hand, Quasi polynomial noisy quantum circuits can 
compute any function in QNC 1 . 
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For the lower bound we use the results in , in which, using noisy quantum circuits which allows 
the qubit to be initialized at different times, it is shown how to make the circuit noise-resistant with 
polylogarithmic blow-up in the depth. The reason for the fact that in the quantum case the bounds 
are not tight, is due to the fact that not as in classical circuits JT^, it is yet unknown if quantum noise 
resistance can be achieved with constant blow-up in the depth. 

We emphasize again, that these results is very specific to the model we defined, and as in the 
case of noisy reversible (classical) circuits, a slight change in the definitions changes dramatically the 
complexity power, and variant definitions of "noisy quantum circuits" turn out to be equivalent in 
power to normal quantum circuits, due to the results in |], |Tc| ] . 

2 Noisy reversible circuits - The Upper Bound 

In this section we prove the upper bound, meaning that a noisy reversible circuit can simulate any 
circuit with exponential cost. 

Theorem 5 If a boolean function f can be computed by a boolean circuit of size s and depth d, then 
f can be computed by a noisy reversible circuit of size 0(s ■ 2°^) and depth O(d). 

Proof: By ||, we can convert the circuit that computes / to a reversible circuit, R, which has linear 
depth and polynomial size. We now want to convert R to C, a noise-resistant reversible circuit which 
computes / with high probability. Note that the majority function can be implemented reversibility, 
by a three bit to three bit gate, of which the first output is the majority of the three inputs. Also, 
this reversible function on three bits can be implemented by a constant number of gates from the 
universal set being used, where these gates will operate on the three bits plus a constant number of 
extra bits, which will all be output in the state they where input. To construct C, replace each bit 
in R by 3 d bits. Each time step we will limit the computation to a third of the bits, which will be 
"good" . In the i'th time step we will operate on 3 d ~ z good bits. This is done as follows: A gate in the 
i'th time step in R, is transformed in C to 3 d ~ 4 copies of the same gate, applied bitwise on the 3 d_I 
good bits. We then divide these bits to triples, apply reversible majority gates on each triple, and 
limit ourselves to the 3 d_z_1 results of these gates, which will be the good bits, on which we operate 
bitwise the next time step in R, and so on. The claim is that if p is small enough, the probability for 
"good" bit at time step i to err is less than p. The proof is by induction on i: Let the input bits for 
the i'th step have error with probability < p. Another noise step makes this probability < 2p. After 
the computation step, if the fan-in of the gates is < k, than the error probability for each output is 
< 2kp. After another noise step, the error probability is < (2k + \)p. Now apply the majority gate. 
Note that the error probabilities for each one of the inputs to the majority gates are independent. 
Hence the error probability for the result of the majority gate is less than 3((2fc+ l)p) 2 + ((2k + l)p) 3 , 
which is < p if p is smaller than some constant threshold.l 

3 Noisy Reversible Circuits - The Lower Bound 

In this section we prove the lower bound, meaning that after 0(log(n)) steps of computation there 
is exponentially small amount of information in the system. We first show that each step of faults, 
reduces the information in the system by a constant factor which depends only on the fault probability. 



Lemma 1 Let X be a string of n bits, which is a random variable. Let Y be the string of n bits 
generated from X by flipping each bit with independent probability p. Then I(Y) < (1 — 2p) 2 1 (X) , 
where I is the Shannon information. 

Proof: Let us first prove this for n = 1. Let a,/3 be the probability that the bit X, Y be 1, respectively. 
Let a = 1/2 + 5/2, 

/? = (1 - p)a + p(l - a) = l/2 + 6p/2 
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where p = 1 — 2p, \p\ < 1. Then I(X), I(Y) are functions of 5 and p. 

/(X) = 1 + p log(p ) + fulogfa) = K{5) = ((1 + 8)log(l + 8) + (1 - 5)Zo. 9 (l - <5))/2. 
Developing if (5) to a power series we get 

oo 

K(6) = (l/ln(2))J2(S) 2k mk(2k-l)} 

k=l 

converging for all < 6 < 1. Therefore 

oo 

I(Y)=K(Sp) = (l/ln(2))J2(Sp) 2k /[2k(2k-l)} 

k=l 

OC 

< (l/ln(2))p 2 ^ S 2k /[2k(2k - 1)] = p 2 K (5) = p 2 I(X) 

fc=i 

proving that I(Y) < (1 — 2p) 2 I(X). We now use this result to prove for general n. Let us write the 
strings X, Y as X\, X2, ■■■,X n and Y±, Y2, Y n . where Xi and Y.- L are random variables that get the 
value or 1. 

n n 

I(Y) = J2 HYlY+u .Y n ) < J^HYilXi+i, -X,), 

i=l i=l 

where we used the fact that I(A\C) < I(A\B) where A, B, C are random variables, and C is a function 
of B, where the function might also be a random variable, independent of A and B. Using the following 
known formula: I(A\B) = ~^2 b Pr(B = b)I(A\b), we can write the last term as 

n 

n 

Y Y Pr{X l+1 =x l+1 ...X n =x n )(l-2p) 2 I(X. l \x l+u ..,x n ) = {l-2p) 2 I(X), 

where we have used the proof for one variable. | 
We can now prove the main theorem: 

Theorem [j]: For any noisy reversible circuit of size s and depth d which is not worthless, s — 2 n ^ . 

Proof: We first note that since each level of computation is reversible, the entropy does not 
change due to the computation step, and since the number of bits is constant, the information does 
not change too during a computation step. We start with information n, and it reduces with rate 
which is exponential in the number of noise steps: After m steps the information in the system is less 
than (1 — 2p) 2m n, by lemma |l|. When m = 0{log(n)) the information is polynomially small. The 
information on any bit is smaller than the information on all the bits.| 



4 Quantum Computation 

In this section we recall the definitions of quantum circuits ||, ^, ^ with mixed states^], quantum 
noise, and quantum entropy 113] . 
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4.0.1 Pure states 



We deal with systems of n two-state quantum particles, or "qubits" . The pure state of such a system 
is a unit vector, denoted \a), in the Hilbert space^C 2 ™, i.e. a 2" dimensional complex space. We view 
C 2 as a tensor product of n two dimensional spaces, each corresponding to a qubit: C 2 = C 2 <S> ■ ■■®C 2 . 

As a basis for C 2 , we use the 2 n orthogonal basic states: \i) = £§> ^2) ® \in)> < i < 2™, where 

i is in binary representation, and each ij gets or 1. Such a state corresponds to the j'th qubit being 
in the state A pure state \a) G C 2 is a superposition of the basic states: \a) — X)i=i c *l*)i 

with 2i=i \ c i\ 2 = 1- l a } corresponds to the vector v a = (c\, c%, C2»). the complex conjugate 
of v a , is denoted (a|. The inner product between \a) and \(3) is (a|/3) = (v a ,vt). The matrix 
is denoted as |a)(/3|. An isolated system of n qubits develops in time by a unitary matrix^] of size 
2" x 2": |a(^2)) = U\a(ti)). A quantum system in C 2 can be observed by measuring the system. An 
important measurement is a basic measurement of a qubit q, of which the possible outcomes are 0, 1. 
For the state \a) — Y^i=i c i|*); tne probability for outcome is p — J^i i\ =0 l c i| 2 aim the state of the 
system will collapse to |/3) = »| =0 C »N)' (t ne same f° r !)■ ln general, an observable O over C 2 is 

an hermitian^] matrix, of size 2™ x 2 n . To apply a measurement of O on a pure state \a) G C 2 , write 
I a) uniquely as a superposition of unit eigenvectors of O: \a) = X^ c il°i)j where |oj) have different 
eigenvalues. With probability |ci| 2 the measurement's outcome will be the eigenvalue of \oi), and the 
state will collapse to |o,}. A unitary operation U on k qubits can be applied on n qubits, n > k, 
by taking the extension U of U, i.e. the tensor product of U with an identity matrix on the other 
qubits. The same applies for an observable O to give O. 

4.0.2 Mixed states 

A system which is not ideally isolated from it's environment is described by a mixed state. There are 
two equivalent descriptions of mixed states: mixtures and density matrices. We use density matrices 
in this paper. A system in the mixture {a} = {pk, \cxk}} is with probability pk in the pure state 
«fc}. The rules of development in time and measurements for mixtures are obtained by applying 
classical probability to the rules for pure states. A density matrix p on C 2 is an hermitian positive 
semi definite complex matrix of dimentions 2" x 2", with tr(p) = 1. A pure state \a) = X^ c iN) 
is associated the density matrix p\ a ) = i.e. P\ a )(hi) = c i c *j- A mixture {a} = {pi,\ai)}, is 

associated the density matrix : P{ a } = ^2iPW\ai)- The operations on a density matrix are defined 
such that the correspondence to mixtures is preserved. If a unitary matrix U transforms the mixture 
{a} = {pi,\ai)} to {/?} = {pi, U\ai)}, then p^y = ^ ; p;J7|a;)(a(|C/t = Up{ a yU^. Let p be written 
in a basis of eigenvectors Vi of an observable O. A measurement of O on p gives, the outcome A 
with the probability which is the sum of the diagonal terms of p, which relate to the eigenvalue A: 
pr(X) = Y^i=i Pvi,viS(Xi = A), conditioned that the outcome is the eigenvalue A, the resulting density 
matrix is 0\ o (p), which we get by first putting to zero all rows and columns in p, which relate to 
eigenvalues different from A, and then renormalizing this matrix to trace one. Without conditioning 
on the outcome the resulting density matrix will be O o (p) = Pr(\k)0\ k o (p). which differs from 
p, only in that the entries in p which connected between different eigenvalues are put to zero. Given a 
density matrix p of n qubits, the reduced density matrix of a subsystem, A, of, say, m qubits is defined 

as an average over the states of the other qubits: p\A(i,j) — Sfc=i p(ik,jk). 

4.1 Quantum circuits with mixed states 

A quantum unitary gate of order k is a complex unitary matrix of size 2 k x 2 k . A density matrix 
p will transform by the gate to g o p = U pU\ where U is the extension of U. A measurement gate of 

1 A Hilbert space is a vector space with an inner product 

2 Unitary matrices preserve the norm of any vector and satisfy the condition U~ 1 = 
3 An hermitian matrix H satisfies H = 
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order k is a complex hermitian matrix of size 2 k x 2 fe . A density matrix p will transform by the gate 
to g o p = O o (p). ^4 Quantum circuit is a directed acyclic graph with n inputs and n outputs. Each 
node v in the graph is labeled by a quantum gate g v . The in-degree and out-degree of v are equal to 
the order of g v . Some of the outputs are labeled "result" to indicate that these are the qubits that 
will give the output of the circuit. The wires in the circuit correspond to qubits. An initial density 
matrix p transforms by a circuit Q to a final density matrix Q o p = g t o ... o g 2 o gi o p, where the 
gates gt-.-gi are applied in a topological order. For an input string i, the initial density matrix is 
p\i). The output of the circuit is the outcome of applying basic measurements of the result qubits, 
on the final density matrix Q o pu\. Since the outcomes of measurements are random, the function 
that the circuit computes is a probabilistic function, i.e. for input i it outputs strings according to a 
distribution which depends on i. 

4.2 Noisy Quantum Circuits 

As any physical system, a quantum system is subjected to noise. The process of errors depends on 
time, so the quantum circuit will be divided to levels, or time steps. In this model, (as in but not 
as in all qubits are present at time 0. The model of noise we use for noisy quantum circuits is 
a single qubit noise, in which a qubit is damaged with probability 1/2 > p > each time step. The 
damage operates as follows: A unitary operation operates on the qubit and a state of the environment 
(The environment can be represented by m qubits in some state) . This operation results in a density 
matrix of the n qubits of the system and the environment. We reduce this density matrix to the n 
qubits of the circuit to get the new density matrix after the damage. The density matrix of the circuit 
develops by applying alternately the computation step and this probabilistic process of noise. The 
function computed by the noisy quantum circuit is naturally the average over the outputs, on the 
probabilistic process of noise. 

4.3 Quantum Entropy 

In this subsection we give some background about the notion of quantum entropy and it's relation 
to Shannon information. All deffinitions and lemmas can be found in the book of Asher Peres fl~2| . 

Definition 1 The (von Neumann) entropy of a density matrix p is defined to be S(p) = —Tr(plog2(p)). 



Definition 2 The information in a density matrix p of n qubits is defined to be I(p) = n — S(p). 

The Shannon entropy, H, in the distribution over the results of any measurement on p is larger 
then the Von-Neumann entropy in p. This means that one can not extract more Shannon information 
from p than the Von Neumann information. 

Lemma 2 Let O be an observable of n qubits, p a density matrix of n qubits. Let f be the distribution 
which p induces on the eigenvalues of O. Then H(f) > S(p). 

As the Shannon entropy, the Von-Neumann entropy is concave: 

Lemma 3 Let pi be density matrices of the same number of qubits. Let pi be some distribution on 
these matrices. S(Y,iPiPi) > J2iPiS(Pi)- 

The Shannon entropy of two independent variables is just the sum of the entropies of each one. 
The Quantum analog is that the Von-Neumann entropy in a system which consists of non-entangled 
subsystems is just the sum of the entropies: 

Lemma 4 S{p±) + S(p 2 ) = S(pi ® p 2 ). 

One can define a relative Von-Neumann entropy: 
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Definition 3 Let pi,p2 be two density matrices of the same number of qubits. The relative entropy 

of pi with respect to pi is defined as S(pi\p2) = Tr[p2{log2{p2) — log2(pi))]- 

The relative entropy is a non-negative quantity: 

Lemma 5 Let pi , p 2 be two density matrices of the same number of qubits. The relative entropy of 
Pi with respect to p 2 is n on negative: S{p\\p-x) > 0. 

5 The Quantum upper Bound 

In this section we prove the upper bound for the case of noisy quantum circuits: a noisy quantum 
circuit can simulate any quantum circuit with exponential cost. 

Theorem ^j: If a boolean function / can be computed by a quantum circuit of size s and depth d, 
then / can be computed by a noisy quantum circuit of size 0{s- poly log (s)) • 2°( d P '»' f( d )) and depth 
0{d ■ polylog(d)). 

Proof: In [Q it is shown that any quantum circuit Q, with depth d and size s, can be simulated 
by a noisy quantum circuit Q, with depth poly logarithmic in d and size polylogarithmic in s, where 
a different model of noisy quantum circuit is used: qubits are allowed to be initialized at any time 
during the computation. To adapt the proof to our model, in which all qubits are present at time 0, it 
will suffice to show that there exists a noisy quantum circuit, A t , with depth t operating on 3* qubits, 
such that if the error probability is p, for an input string of all zeroes, at time t the first qubit is in 
the state |0 > with probability > (1 —p). If such A t exists, than for each qubit, q, which is input to 
Q at time t, we simply add 3* — 1 qubits to the circuit, and together with q they will be initialized 
at t = to be |0 >. On these 3* qubits we will operate the sequence of gates A t , and the first qubit 
will play the role of q after time t. The new circuit is a noisy quantum circuit for which all qubits are 
initialized at time 0, it's size is 0(s ■ polylog(s)) ■ 2°( d P°^ io 9( d )) and depth 0(d ■ polylog(d)). 

A t is constructed as follows: We begin with 3* qubits in the state |0 >. These qubits can be divided 
to triples of qubits. We apply the following "majority" quantum gate, on each triple: 

|000 >i — ► |000 > , 1 100 >i — > |011 > 

|001 >i — ► |001 > , |101 >i — > 1 101 > 

|010>i — >|010> , |110>i — >\110> 

|011 >i — >|100> , |111>i — ►|111>. 

The first qubit of each triple carries now the result of the majority. (Note that the function of majority 
works here, in the quantum case, because we only need to deal with non-entangled states: the zero 
|0 > state. The majority gate is not a good method to pick the majority out of three general pure 
states.) All these 3 t_1 result qubits can now be divided also to triples, and we apply majority gates on 
these triples, and so on, until time t. We claim that the error probability of the result qubits of time 
step i < t is < p, if p is smaller than some threshold. Let us prove this by induction on i. If each qubit 
in the i'th time step has error probability < p, than after one noise step it's error probability is < 2p. 
The majority gate is applied on qubits with independent error probabilities. Thus the probability for 
the majority result to err is less than 3(2p) 2 + (2p) 3 . This probability is smaller than p if p is small 
enough. I 

6 The Quantum Lower Bound 

We prove that in a noisy quantum circuit the information decreases exponentially in the number 
of time steps, in the presence of the following type of quantum noise, where a qubit that undergoes a 
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fault is replaced by a qubit in one of the basic states, which is chosen randomly. Such a qubit carries 
no information. We first show that a noise step causes the information in the circuit to decrease at 
least by a constant factor (1— p), and then show that quantum gates can only decrease the information 
in the system. This is true even for the more general model of quantum non-reversible circuits, where 
measurement gates are allowed. 

Let us first show that quantum gates can only reduce information: 

Lemma 6 Let g be a quantum gate, p a density matrix. L(g o p) < I(p). 

Proof: For a unitary gate, which is reversible, I(p) = I(g o p). The proof for the case of a 
measurement gate is given in the appendix.l 

In order to show that during a noise step the information decreases by a factor, we need the 
following lemma, (which is the quantum analog of a theorem proved in []): The average information 
in k qubits chosen randomly out of n qubits, is smaller then — the information in all the n qubits. 

Lemma 7 Let p be a density matrix of n qubits, and let k < n. Then -? — L(p\A k ) < —I(p). 

(I) 

We can now prove that for a specific type of quantum noise, that in which qubit is replaced with 
probability p by a qubit in a random state, the information in the quantum circuit decreases by a 
factor of (1 — p) after each noise step: 

Lemma 8 Let p be a density matrix of n qubits. Let each qubit in p be replaced with independent 
probability p by a qubit in the density matrix pu = ^(|0}(0| + |1)(1|), to give the density matrix a. 
Then 1(a) <(l-p)I(p). 

Proof: 

Let us write a = J2k=i Yl,A k P nk ^ — P) k P\A k &> p 7 R~ k , where the sum over Ak is a sum over all 
subsets of k qubits, and the power on the density matrices means taking n — k times the tensor product 
of pfj. This presents the resulting density matrix as a probability distribution over all possible cases 
where the faults could have occured, with the correct probabilities. By the concavity of the entropy 
we have: L(a) < J2k=i P n_fc (l — p) k J^A k [Hp\A k ) + (n — k)I(pn)] , where we have used lemma |[ Since 
L{pr) = 0, we have that 

n 

I(c7)<£p"-*(l-p)^I(pUJ. (1) 

k=0 A k 

Using lemma 0, we get 1(a) < £LiP"^(l ~p) k ^ f £ ) I(p) = (l-p)I(p)M 
We can now prove the lower bound on noisy quantum circuits: 

Theorem ^ For any noisy quantum circuit of size s and depth d which is not worthless, s = . 

Proof: Using lemmas |6| and ^|we can show by induction on t that after t time steps, the information 
in the system I(p) < (1 — p)*s, so the information in the final density matrix is < (1 — p) d s. The 
classical information in the probability distribution which we get when measuring the result qubits in 
p is smaller than I(p), due to lemma |^ and the fact that the basic measurements of the r result qubits 
can be replaced by one observable on the result qubits, with each possible string \i) as an eigenvector 
with eigenvalue i. | 

7 Open Question 

We have shown that the power of noisy reversible circuit is as the complexity class NC 1 . Is the 
power of noisy quantum circuit exactly that of the quantum analog complexity class, QNC 1 ! Making 
the lower bound tight connects to the following open question: Can noisy quantum circuits be made 
noise resistance with only a constant blow-up in depth? 
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8 Appendix-Quantum Entropy lemmas 

In this appendix we give the proofs of lemmas regarding quantum entropy. All the proofs, except 
the last one which is a new result as far as we know, are taken from plj . 

Lemma $t Let O be an observable of n qubits, p a density matrix of n qubits. Let / be the 
distribution which p induces on the eigenvalues of O. Then H(f) > S(p). 

Proof: Let {i>j} be the eigenvectors of p, with Pi there eigenvalues. S(p) = — 5^ i\ - Zop(p,-). The 
probability to get an eigenvalue A., measuring p is Qj = J2k PkGkj, where Gkj is the probability to 
get Xj when measuring v k . So S(f) = - £\ Qjlog{Qj)- S(f)-S(p) = J2i Pi l og(pi)-J2j Qjlog(Qj) = 
J2iPi(log{Pi) - V, C..J<>!I { Q.:'; = PiGijlog{Pi/Qj)). Where we have used Y,j G i,j = l - Now 
logx > 1 - A, to give that S(f) - S(p)> ^ /'//,,.; 1 - Qi/Pj) = 0J 

Lemma ^ Let p % be density matrices of the same number of qubits. Let p l be some distribution 
on these matrices. S(J2 i P l p l ) > J2iP l S(P 1 )- 

Proof: Let us interprate the diagonal terms in a density matrix p as a classical probability distribu- 
tion D. We have that in any basis, H(D) < S(p), where the equality is achieved if and only if the p is 
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written in the basis which diagonalized it. Let us write all matrices in the basis which diagonalizes the 
matrix p — Ej p 1 /; 1 , and let the diagonal terms of p be the distribution D and the diagonal terms in pi 
be the distributions Di, respectively. We than have §(p) = H{D) = Hi^ ji piDi), where the sum is in 
each coordinate, and by the concavity of classical entropy, we have that H{^2 i piDi) < ^2nPiH{Di), 
but since H{Di) < S(pi) it closes the proofj. 

Lemma g: S(pi) + S{p 2 ) = S(pi ® p 2 ). 

Proof: If {Xj}, {X 2 } are the sets of eigenvalues of p\,pi respectively, the eigenvalues of p\ ® p 2 are 
just {A^Af }, and the entropy is S( Pl ® p 2 ) = - E 4J X\X 2 log(X\X 2 ) = S( Pl ) + S(p 2 ).| 

Lemma |^: Let pi,p 2 be two density matrices of the same number of qubits. The relative entropy 
of pi with respect to p 2 is non negative: S(pi\p 2 ) > 0. 

Proof: Let {|f m }}, {|^ m )} be the eigenvectors of pi,p 2 respectively, and {A m }, {A m } be the corre- 
sponding eigenvalues, respectively. We can write log(p2) = Em ^ 9(^m)\ v rn)( v m\- We want to evaluate 
the relative entropy S(pi\p 2 ) — Tr[p 2 (log 2 (p 2 ) — log 2 (pi))] in the first basis {|v m )} where pi is di- 
agonal. The diagonal elements of log(p 2 ) in this basis are: log(p 2 ) m . m = E„ log{Ki)( v m\ v n){ v V\ v m) = 



EJog(xl)\(vl\v 2 n )\^soS( Pl \p 2 )^E m ^og(xl)-EJ 

where we have used E m K^mlOl 2 = 1- Since log(x) > 1 - ± S{ Pl \p 2 ) > Y, m ,n xl m \( vl m K)\ 2 (^ - 



Let us prove another fact which will be needed. 
Lemma 9 Let p\ be a reduced density matrix of p to a subsystem A. Then —Tr{plog(p\ ® I)) 



Proof: Let us write everything in a basis for the whole system, where p\ is diagonal. In this basis 
log(pi ®I))m,in = log((pi)i t i, where the first index (i or j) indicates qubits in A, and the second index 
indicates the qubits which we disregard and trace over. —Tr{plog(pi®I)) = — Ei n (Pin,inl°9((Pi)i,i) — 
— ~Yli{p%ilog{{pi)i,i) — S(pi), where we have used the fact that pi is a reduced density matrix which 
satisfies (pi)i,j = J2 n (P) in >jnM 



Proof: The entropy is invariant under unitary transformation, since unitary transformations 
change the eigenvectors, but does not change the set of eigenvalues which is what determines the 
entropy of the density matrix. Therefore the information does not change if g is unitary. If g is a 
measurement gate, let us write p, go p in a basis of eigenvectors of the extension of the observable <?, in 
which g o p is diagonalizes. By lemma |^, the relative entropy of g o p with respect to p is non negative. 
Writing the relative entropy in the basis of eigenvectors: < S(go p\p) = Tr[p(log 2 (p) — log 2 (g o p))] = 
—S(p) — Em Pm,mlog((g o p) m , m ) = —S(p) + S(g o p), where the last equality is due to the fact that 
in this basis p m ,m — (<? p)m,m- Hence n — S(g o p) < n — S(p)M 

Lemma 0: Let p be a density matrix of n qubits, and let k < n. Then , 1 E,4 fc Hp\A k ) < TiHp)- 



of all the possible reduced p\A k , each taken n copies. p\ and p 2 are matrices of an equal number of 



Tr{p l log{p 1 )) = S{px). 



Lemma ^: Let g be a quantum gate, p a density matrix. I(g op) < I(p)- 





copies of p, and let p 2 be the tensor product 




Hence we can use the non negativity of the relative entropy ( lemma g|) , and write 
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< S(p 2 \ Pl ) = Tr( Pl (log( Pl ) - log(p 2 ))) = -S(pi) ~ Tr( Pl log(IL Ak (p\ Ak ) n )) = -k (™ j S(p) - 

Y2 A Tr( P ilog(( P \A k ) n <8> I n )), where all products and powers of matrices are understood as tensor 
products, and using the fact that the logarithm of tensor products can be written as the sum of 
logarithms. We now observe that { P \A k ) n is a reduced density matrix of P \ if k > 0. Lemmas ^ and [l] 

imply that < -k ( ™ ) S(p) + J2 Ak nS(p\ Ah ), so for k > 0, £ Afc I(p\ Ak ) < £ ( » ) I(p)M 
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